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Abstract 

Background: Spinosad is a macrolide antibiotic produced by Saccharopolyspora spinosa witli aerobic fermentation. 
However, tine wild strain has a low productivity. In this article, a computational guided engineering approach was 
adopted in order to improve the yield of spinosad in 5. spinosa. 

Results: Firstly, a genome-scale metabolic network reconstruction (GSMR) for S.spinosa based on its genome 
information, literature data and experimental data was extablished. The model was consists of 1,577 reactions, 1,726 
metabolites, and 733 enzymes after manually refined. Then, amino acids supplying experiments were performed in 
order to test the capabilities of the model, and the results showed a high consistency. Subsequently, transhydrogenase 
(PntAB, EC 1 .6.1 .2) was chosen as the potential target for spinosad yield improvement based on the in silico metabolic 
network models. Furthermore, the target gene was manipulated in the parent strain in order to validate the model 
predictions. At last, shake flask fermentation was carried out which led to spinosad production of 75.32 mg/L, 
86.5% higher than the parent strain (40.39 mg/L). 

Conclusions: Results confirmed the model had a high potential in engineering 5. spinosa for spinosad production. 
It is the first GSMM for S.spinosa, it has significance for a better understanding of the comprehensive metabolism 
and guiding strain designing of Saccliaropolyspora spinosa in the future. 
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Background 

Spinosad is a macrolide antibiotic produced by Sacchar- 
opolyspora spinosa with aerobic fermentation. Due to its 
unique chemical structure and mechanism, it can control 
diamondback moth; beet armyworm and other lepidop- 
teron pests effectively, while have non-toxic to mammals 
and birds [1]. Therefore spinosad has been considered to 
be the most effective biological pesticides after avermectin, 
becoming a new hot issue in the research and develop- 
ment of biological pesticides. The compounds got the 
"Presidential Green Chemistry Challenge Award" of the 
United States in 1999, because it has the chemical 



Correspondence: wenyulu@tju.edu.cn 
^Department of Biological Engineering, School of Chemical Engineering and 
Technology, Tianjin University, Tianjin 300072, PR China 
^Key Laboratory of system bioengineering (Tianjin University), Ministry of 
Education, Tianjin 300072, PR China 

Full list of author information is available at the end of the article 



pesticide s fast-acting and the biopesticide s safety, low 
residual properties. So its derivative got the award 
again in 2008 [2]. 

Spinosyns A and D, the two major components in the 
S, spinosa fermentation,are defined as spinosad [3]. It 
mainly contains a 21 -carbon tetracyclic with two deoxy- 
sugars: tri-O-methylated rhamnose and forosamine [4]. 
Waldron firstly analyzed the proposed spinosyn biosyn- 
thetic pathway [5], In recent years, the spinosad biosyn- 
thetic pathway has been clarified more accuracy: SpnA, 
spnB, spnC, spnD, and spnE responsible for type I poly- 
ketide synthase; spnF,spnJ, spnL, and spnM for modify- 
ing the polyketide synthase product [6]; spnG, spnH, 
spnl, and spnK for rhamnose attachment and methyla- 
tion [7]; spnP, spnO, spnN, spnQ, spnR, and spnS for 
forosamine biosynthesis; gtt, gdh, epi, and kre for rham- 
nose biosynthesis [8] and beside the spinosad gene 
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cluster four genes ORF-L16, ORF-Rl, and ORF-R2, have 
no effect on spinosad biosynthesis. 

The wild stain of Saccharopolyspora spinosa has low 
spinosad productivity, so lots of work has been done in 
strain modification. Three strategies have been mainly 
used in the spinosad improvement: (a) Traditional phys- 
ical and chemical mutagenesis [9]; (b) Optimization of 
fermentation process [10,11]; (c) Improving the spinosad 
production by the genetic engineering method. Pan [12] 
got a over threefold spinosad production by introducing 
the additional gtt and gdh genes under the control of 
PermE* promoter. Tang [13] increased the spinosyn pro- 
duction by 3.8-fold through overexpression of the spino- 
syn biosynthetic genes participating in the conversion of 
the cyclized polyketide to spinosyn, obtained by direct 
cloning via Red/ET recombination. Xue [14] duplicated 
s^nP, spnQ, spnN, spnQ, spnR, spnS, spnK, gtt, gdh and 
kre genes under the control of PermE* promoter, spino- 
sad production was a 5.0-fold enhancement compared 
with the wild-type S. spinosa. However, the spinosad is 
a secondary metabolite which has complex metabolic 
pathways. It is difficult to satisfy the industrial demands 
if we only pay attention to a few gene modifications. 

In recent years, more and more whole genomes have 
been sequenced with the development of high-throughput 
sequencing technology and reduction of sequencing prices. 
It is a hot topic to study how to use such a large genome 
database. The genome-scale metabolic network (GSM) 
constructed based on genome annotation information 
provide an integral level to review organism metabol- 
ism. Now Genome-Scale-Metabolic-Model (GSMM) re- 
construction technology has a great progress no matter 
in speed but quality [15,16]. At present, many articles 
has elaborated GSMM estabUshment process [17,18], 
these provided foundation to build a precision standard- 
ized GSMM. GSMM is widely used to aid strain trans- 
formation in metabolic engineering as a systems biology 
tool [19,20]. Brochado [21]used GSMM successfully pre- 
dict several genes targets in Bread yeast, which increased 
the vanillin yield about 5-fold. Huang used the GSMM 
of Streptomyces tsukubaensis to improve the FK506 pro- 
duction which led to a 1.47 fold increase [22]. The similar 
way was also used to increase the expression of the green 
fluorescent protein in lactococcus lactis [23]. GSMM re- 
construction is a time consuming work, the metabolic 
reconstruction for secondary metabolite producing still 
requires extensive manual refinement, because the lack 
of genome annotation information. 

There were no reports on the refined GSMM for Sac- 
charopolyspora spinosa. In this work, the first manually 
refined GSMM for Saccharopolyspora spinosa was estab- 
lished. The amino acid supplementation experiments were 
conducted to verify the module. Then a potential target 
gene transhydrogenase was assessed. Overexpression of 



this gene led to 86.5% higher spinosad yield than that of 
the wild stain. 

Results and discussion 

Character of the GSMM 

The first GSMM for Saccharopolyspora spinosa using a 
highthroughput protocol was constructed. The GSMM 
contained 1577 metabolites and 1736 reactions and 733 
enzymes (Additional file 1 shows the network in more 
detail). Given that the spinosad was produced alongside 
biomass, 42 reactions were required to fill the model 
gaps. Major metabolic pathways for S.spinosa, including 
the glycolytic pathway, pentose phosphate pathway (PPP), 
tricarboxylic acid (TCA) cycle, and spinosad biosynthesis, 
are illustrated in Figure 1. Additionally five subsystems 
were divided, containing: gluconeogenesis, amino acid 
synthesis pathway, fatty acid synthesis pathway, phospho- 
lipid synthesis pathways and nucleotide synthesis path- 
ways. The Character was summarized in Table 1. 

Amino acid addition in silico simulation and experiments 

To validate the spinosad biosynthetic pathway, we used 
the GSMR model to predict the potential benefit of sup- 
plementing the media with individual amino acid (AA) 
in terms of spinosad production. Eight AAs (Leu, Gly, 
Met, Ala, Asp, Cys, Lys, Pro) were individually added to 
study the effect on spinosad production. The experiments 
and in silico simulation results were showed respectively 
in Figures 2B and A. The experiment dates showed that 
Gly was the best amino acid to spinosad production 
and following Leu, Ala. Pro had a negative effect on 
spinosad production. The remaining four amino acids 
(Asp, Met, Cys, Lys) displayed no significant effects on 
spinosad production. 

Amino acids play an important role in antibiotic syn- 
thesis and cell growth in the fermentation process as 
important biological small molecules. AA can directly 
participate in the biosynthesis of antibiotics as the pre- 
cursor [24,25]; AA or its peptide structure influence the 
biosynthesis of antibiotics by regulating the activity of 
the enzymes in the pathway as activators or inhibitors 
and its hydrolysates can change microbial growth envir- 
onment by altering the PH in the medium. The main 
product in glycine metabolism is acetyl-COA which is 
the precursor of spinosad [26], this may explain the 
results in our work. Hinnebusch [27] proved that the 
GCN4 gene which encodes a positive regulator of un- 
linked amino acid biosynthetic genes in yeast is itself 
regulated by amino acid availability and that the regula- 
tion occurs at the translational level. Proline inhibit the 
spinosad synthesis may due to it or its metabolites par- 
ticipate in the regulation process. The GSMM in our 
work has no regulation systems, so the result was not 
consistent with the experiments. 
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Figure 1 Schematic diagram of metabolic pathiways for S.spinosa. The NADH/NADPH generation and consumption are sliowed in red. Tlie 
enlianced patliway is sliowed in blue (solid line represents gene overexpression, dotted line means possible enhanced pathways). The weakened 
pathway is showed in purple. 



Impact of PntAB gene on spinosad synthesis 

Spinosad is a secondary metabolite. Energy and reducing 
power NADPH were required in its synthesis process. 
Cellular reducing power level, such as NADPH/NADP"^ 
and NADH/NAD^, has an important role in the metabolic 
regulation in the cell. Therefore the reaction R00112 

Table 1 Network summary for S.spinosa GSMM 



Metabolic network characteristics Total 

Metabolites 1737 

Reactions 1577 

Intracellular reactions 1 377 (87.3%) 

Transport reactions 105 (6.6%) 

Exchange reactions 95 (6.1%) 

Reactions added filled by manual curation 42 

Enzymes 733 



(NADPH + NAD"" = NADP"" + NADH) was used as control 
variables, the spinosad synthesis reaction as a target re- 
action to study the effect of transhydrogenase activity 
on spinosad synthesis. The results were exhibited in 
Figure 3A. 

In the Figure 2A, when the reaction R00112 flux is 0, 
the spinosad synthesis rate didn't reach the peak. When 
the reaction flux is positive (NADPH NADH), the spi- 
nosad synthesis rate decreased. In contrast, the spinosad 
synthesis rate climb up and then decline when the the 
reaction flux is negative. Further conclude can be ob- 
tained, spinosad was produced in the stationary phase, 
at this phase, transhydrogenase in the cell kept the bal- 
ance of the reducing power(NADPH/NADH), but the 
balance point was not the best for spinosad synthesis. In 
order to prove the simulation results, pntAB was over 
expressed, the yield of spinosad in S.spinosa 26 IP in- 
creased to 37.9 mg/L which was little higher than the 



Wang et al. Microbial Cell Factories 2014, 13:41 
http://www.nnicrobialcellfactories.conn/content/1 3/1/41 



Page 4 of 7 




wild strain(34.4 mg/L). Then 0.1% rhamnose was added, 
the yield increased to 75.32 mg/L which was 0.86-fold 
higher than that in the wild-type S. spinosa (40.39 mg/L). 

Efficient regeneration of NADPH which mainly comes 
from PPP and TCA pathways plays an important role 
in biotransformation processes. The biotransformation 
is often limited by cofactor regeneration because cells 
would rather grow better than intend to sustain the 
biotransformation [28,29]. The NADPH regeneration 
system needs to be enhanced in order to satisfy bio- 
transformation. Three ways were modified to increase 
the NADPH level in the cell including oxidative part of 
the pentose phosphate pathway (PPP) [30,31], the tricarb- 
oxylic acid cycle (TCA) [32], and the transhydrogenases 



system. Overexpression of pntAB in E. coli got a 3.5-fold 
conversion of acetophenone to (R)-phenylethanol [33]. 
The similar way was conducted in Corynebacterium gluta- 
micum to get a high yield of L-lysine [34]. Bastian [35] ob- 
tained a 6.5-fold enhancement of isobutanol production 
under anaerobic conditions in E. coli by overexpression of 
pntAB, In our work, two reasons may explain the results 
that the engineered strain has a low production in general 
fermentation medium: Firstly, spinosad is a secondary me- 
tabolite and it has a low synthesis rate; Secondly, NADPH 
was mainly used in biomass synthesis. 

Rhamnose is both the precursor of spinosad and the 
cell wall ingredient, but the Saccharopolyspora spinosa 
only have one set of rhamonose gene [8,36]. The yield of 




Flux of transhydrogenase(mmol/gDW*h) 

Figure 3 The effect of reducing power level on spinosad synthesis. A: Impact of transhydrogenase activity on spinosad syntliesis {in silico) B: 
Tine experimental results of shake flask fermentation. 
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spinosad had a 3.5-fold enhancement by dupUcation of 
the rhamnose biosynthetic genes {gtt and gdh) in the 
previous study [14]. So the rhamnose plays an important 
role in spinosad synthesis. The rhamnose in the fermenta- 
tion medium can accelerate the use of pseudoaglycones 
(PSA). Then PSA synthesis made more NADPH flow into 
spinosad synthesis. Reducing power balance changed by 
over expressing pntAB, more NADPH was produced from 
NADH. When the rhamnose was added, more NADPH 
flowed into spinosad synthesis pathway, so the engineered 
strain had a high productivity in rhamnose medium. 

Conclusions 

In this work, the first Genome-scale metabolic network 
of Saccharopolyspora spinosa was constructed following 
a systematic workflow. Then it was validated by amino 
acids supplementation experiments. Finally based on the 
GSMM, PntAB was identified as the target gene. The 
strain was engineered according to the predicted target. 
Fermentation characterization of the engineered strain 
showed the improved capacities of spinosad production. 
The final yield of spinosad is 75.32 mg/L which is in- 
creased by 86.5% than the parental strain. Our results show 
the genome-scale metabolic network model is a powerful 
tool to guide strain engineering towards improved bio- 
production in S.spinosa, as well as other microorganisms. 



Only one form of the multi-step reaction was remained 
in the model. For example, the overall reaction R004385 
was kept while its sub-step reactions R00174 were re- 
moved; At last, reactions' reversibility and cofactors were 
determined and verified. 

At the last stage, spontaneous reactions, exchange re- 
actions, transport reactions, biomass reactions and spi- 
nosad synthesis reactions were added to get a complete 
GSMM. Biomass formation was mainly based on the 
chemicals of the cell. The reactions containing the me- 
tabolites which were verified to be taken up from the 
medium and the metabolite which can diffuse through 
the membranes were added as the transport reaction. 
Exchange reactions were added for all extracellular me- 
tabolites, which represented the system boundaries. 

Biomass equation 

The biomass equation information was mainly got from 
available literature date. It is divided into proteins, RNA, 
DNA, lipid cell wall peptidoglycanpolysaccharide mura- 
mic acid. DNA and RNA composition were determined 
based on genomic data from the Saccharopolyspora 
spinosa genome. Protein composition was determined 
based on open reading frames (ORFs). Other components 
were adapted from S, coelicolor [39]. The biomass com- 
position is described in detail in the Additional file 2. 



Materials and methods 

Genome-scale metabolic network reconstruction 

The genome of Saccharopolyspora spinosa was sequenced 
and annotated in 2011 by Yuanlong Pan [37], the results 
can be got in NCBI [38]. The complete sequence of the 
8.809 Mb Saccharopolyspora spinosa NRRL 18395 gen- 
ome has a 67.94% GC content. The model reconstruction 
followed the standard protocols described previously [17]. 
The process of the genome-scale metabolic network 
reconstruction for the Saccharopolyspora spinosa can 
be described as follows: 

Firstly, a draft reconstruction was created. The necessary 
information, containing genome annotation information 
and the biochemistry information of the enzymes, can be 
searched in KEGG and NCBI. The following factors were 
included in the draft: (1) the basic reaction information 
(ID, name, equation and reversibility), (2) the enzymes and 
genes relevant to particular reaction. (3) the subsystems. 
All the information above was saved in an Excel which 
contains 1830 reactions and 1849 metabolites. 

Secondly, refine the draft model got above. Many reac- 
tions were redundant in the draft while some necessary 
ones were absent, so a lot of work was done in the refine 
process. The reactions and metabolites IDs may different 
in different database, only one ID was remained; The re- 
actions which synthesize or modify the macromolecules 
were replaced by the biomass formulation reactions; 



Constraint-Based Flux Analysis 

Constraint-Based Flux Analysis (FBA) has a wildly use in 
flux distribution calculations [40,41]. The resulting model 
was analyzed using Constraint-Based Reconstruction and 
Analysis (COBRA). Robustness prediction was performed 
using the COBRAToolbox-2.0 in MATLAB, with GLPK 
and CPLEX as the optimization programming solvers 
[42]. The stoichiometric matrix (S) and reaction flux 
constrains were extracted from the SBML file. At the 

Table 2 The strains and plasmid used in this study 

Strain or Description Source or 

plasmids reference 

Strains 



E. coil DH5a 

E. CO// ET- 12567 

S.spinoso ATCC 49460 
S.spinoso 261 P 

Plasmid 

pOJ260 

pOJ261 
pOJ261P 



Host for general cloning TransGen 

biotech 

Donor stain for conjugation [44] 
between £ coli and S. spinosa 

Wild strain ATCC 

S.spinoso ATCC 49460 This study 
harboring pOJ261 P 



E co//- Streptomcyes shuttle [45] 
vector; apr oriT repPUC lacZ 

pOJ260 with ermE* promoter Our lab 

pOJ260 with ermE*-controlled This study 
PntAB 
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model curation process, FBA was used to fill the gaps until 
the model can use the nature medium to synthesize bio- 
mass and spinosad. Robustness [43] analysis is an algo- 
rithm based on FBA, the flux through a selected reaction 
is varied and the optimal objective value is calculated as a 
function of this flux. This method can be used to simulate 
the influence of the biological metabolic state caused by 
internal or external disturbance. 

Bacterial strains, plasmids, media and culture conditions 

Escherichia coli DH5a was used for all plasmid construc- 
tions and amplification. E, coli ET- 12567 was used as the 
door strain in biparental intergeneric conjugations. S, spi- 
nosa ATCC49460 was used as the parent strain. The 
pOJ261, which is pOJ260 with ermE* promoter, was 
used as expression plasmid. The ermE* promoter was 
digested from pIB139. Plasmids and stains used in this 
study are summarized in Table 2. 

E, coli strains were cultured in Luria-Bertani (LB) 
medium at 37°C. The slant and seed medium contained 
(g/L): trypticase soy broth, 30; yeast extract, 3; MgSO^ ♦ 
7H^O, 2; glucose, 10; and maltose, 4, pH 7.2. In the 
slant medium 20 g/L agar was added. The fermentation 
medium contained (g/L): glucose, 68; cottonseed flour, 
22; peptone C, 25; corn seed liquor, 14.5; methyloleate, 
40; and CaC03, 5, pH 7.2. ABB13 medium contained 
(g/L): soy peptone, 5; soluble starch; CaCOs, 3; MOPS, 
2.1; agar 20. 

Strains were cultured on a 220 rpm rotary shaker at 
29°C in a 250 mL flask containing 30 mL of seed medium 
for 3 days. Then 3 mL of seed medium was injected into 
30 mL fermentation media in a 250 mL flask. Strains 
were cultured for 9 days at 29°C on a 220 rpm rotary 
shaker. The fermentations were run in triplicate [14]. 

Amino acids addition 

After 72 h fermentation, 0.05% of amino acids (Leu, Gly, 
Met, Ala, Asp, Cys, Lys, Pro) Standards was added by 
0.22 (i filter membrane filtration. 

Gene cloning, plasmid construction and transformation 

General DNA manipulation was performed according to 
the standard protocols [46]. PntAB was amplified from 
genomic DNA of S, spinosa using primer pairs of F-CC 
TArcrAGACGCCGGACAAGGACGl-XbaI,R-GGATCC 
ArArGACCTCTCCGGAGAGC-NdeI(The itaUcs parts 
represent the restriction enzyme cutting site). The PGR 
product of PntAB was digested by Ndel and Xbal. Then, 
it was cloned into pOJ261 which was also digested by 
Ndel and Xbal to get pOJ261P. 

The constructed plasmid was introduced into S. spinosa 
strains by conjugation from E. coli ET12567 and homolo- 
gous recombination into the chromosome as described 
as [44]. The recombination stain S, spinosa 261P was 



confirmed by PGR amplification with vector primers 
V-F:GGGTGATTTTGTAGGGGTGG and V-R: GGCC 
TAGTTGA GGTATGGTGG, and the result was positive. 

Analysis of spinosad biosynthesis in fermentation cultures 

Spinosad in the fermentation broth was determined as 
described [10]. Every level in the experiment was re- 
peated three times, so the figures in the article were the 
mean value of the experiment. Standard deviations were 
calculated from triplicate flasks. 

Additional files 



Additional file 1: The SPNV1.0 model content. 

Additional file 2: Biomass composition of Saccharopolyspora spinosa. 
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